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Abstract. The Lie-Trotter-Kato product formula was recently extended into 
Hadamard spaces by [Stojkovic, Adv. Calc. Var., 2012]. The aim of our short 
note is to give a simpler proof relying upon weak convergence. Unlike in the 
original proof, we completely avoid using ultrapowers of Hadamard spaces and 
any additional compactness assumptions. 



1. Introduction 

Let / : % — > (— oo, oo] be a convex lower semicontinuous function (lsc) defined 
on an Hadamard space (T-L,d). For instance % can be a Hilbert space and d its 
natural metric induced by the inner product. For the notation and terminology not 
explained here, the reader is referred to Section[5J Given A > 0, define the resolvent 
of / as 



(I) J\{x) := argmin 



f(y) + ±-d(x,y) 2 



The mapping J\ : % — > dom/ is well-defined for all A > 0; see [12J Lemma 2] 
and [5TJ Theorem 1.8]. We also put Jo(x) := x, for all x € W. The gradient flow 
semigroup of / is given as 

(n) 



(2) S t (x) := lim [J±\ (x), iGdom/, 



for any t <E [0, oo). The limit in ([2]) is uniform with respect to t on bounded subin- 
tervals of [0, oo), and (St) t>0 is a strongly continuous semigroup of nonexpansive 
mappings on H; see [15l Theorem 1.3.13], and [21] Theorem 1.13]. 



Remark 1.1. If TL is a Hilbert space, un £ dom/, and we put u(t) = St (wo) , for 
t £ [0, oo), we obtain a "classical" solution to the parabolic problem 

u(t) G -df («(t)) , te(0,oo), 

with the initial condition u(0) = uq. 
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In the present paper, we consider a function / : % — > (—00, 00] of the form 
(3) /:=£/;, 

where f 3 ■ : % — > (—00, 00] are convex lsc functions, 3 — 1, . . . , fc, and fc G N. This 
covers a surprisingly large spectrum of problems and has become a classical frame- 
work in applications; see for instance [7J Proposition 27.8] for the so-called parallel 
splitting algorithm. In the Hadamard space setting, functions of the form ([3]) nat- 
urally emerged in connection with the following example. 

Example 1.2. Given a finite number of points 01, . . . , ajy € W, and positive weights 
u>i, . . . , wn with XL=i w n = 1, we define the function 

JV 

f( x ) := £ w nd(x,a n ) p , xeH, 

n=l 

where p 6 [1, 00). Then / is convex continuous and we are especially interested in 
two important cases: 

(i) If p = 1, then / becomes the objective function in the Fermat- Weber 
problem for optimal facility location. If, moreover, all the weights w n = jj, 
a minimizer is called a median of the points a%, . . . , a/y. 

(ii) If p — 2, then a minimizer of / is the bary center of the probability measure 

N 
n=l 

where 5 an stands for the Dirac measure at the point a n . For further details 
on barycenters, the reader is referred to [HI Chapter 3] and [27] • If) 
moreover, all the weights w n = j^, the (unique) minimizer of / is called 
the Frechet mean of the points 01, . . . , ajy- 

Both medians and means of points in an Hadamard space are currently a subject 
of intensive research for their applications in computational biology; see [31 [5J [35] 
and the references therein. Another area where Frechet means play an important 
role are so-called consensus algorithms |10) . 

Having demonstrated the importance of functions of the form (J3j , we will now 
turn to a nonlinear version of the Lie-Trotter-Kato product formula, which states 
that the gradient flow semigroup of / can be approximated by the resolvents or 
semigroups of the individual functions fj, with j = 1, . . . , fc. We first need some 
notation. 

Notation 1.3. Let f : T~i — > {— 00,00] be a function of the form ([3J, and we of 

course assume that it is not identically equal 00. Its resolvent J\ and semigroup St 
are given by {1} and ([2]) 7 respectively. The resolvent of the function fj is denoted 
J\j f or each j = 1, . . .,k, and likewise the semigroup of fj is denoted S( , for each 

j = l,..., k. 

The symbol Pj will denote the metric projection onto dom/j, where 3 = 1, . . . , fc. 
If F : H — > H is a mapping, we denote its fc th power, with fc € N, by 

F {k) x := (Fo---oF)x, xeU, 
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where F appears fc-times on the right hand side. Having this notation at hand, we 
are able to state a nonlinear version of the Lie-Trotter-Kato product formula due 
to Stojkovic [Ml Theorems 4.4, 4.5, 4.8]. 

Theorem 1.4 (Stojkovic). Let (T~L, d) be an Hadamard space and f : TL — > (— oo, oo] 
be of the form ([3|). We use Notation \1.3[ Then, for any t > 0, and x G dom/, we 
have 

j\o---oj\) (x), 

n n J 



and, 



Si oP k ---oS\ oPA (x). 

n n J 

The convergence in both (|4a|) and (j4b[) is uniform with respect to t on any compact 
subinterval of [0, oo). 

The original proof in [Mj uses ultrapowers of Hadamard spaces or additional as- 
sumptions (local compactness of the underlying space). The aim of the present note 
is to simplify this proof by using the notion of weak convergence. Such simplification 
is believed to be desirable with regard to the importance of the Lie-Trotter-Kato 
formula. 

We shall finish this Introduction by recalling a brief development of the theory 
of gradient flows in Hadamard spaces, which has recently attracted considerable 
interest. It started independently by the work of Jost [T5] and Mayer [3T], when 
the existence of the gradient flow semigroup was established. The study of the 
relationship with the Mosco and T- convergences, initiated already in [15] . was 
treated in greater detail in [?J [TB] . In [B] , the author describes large time behavior of 
the gradient flow as well as its discrete version called the proximal point algorithm. 
As already mentioned above, the the Lie-Trotter-Kato product formula was proved 
in [24]. There have been also many related results in some special instances of 
Hadamard spaces, namely, in manifolds of nonpositive sectional curvature ( |19i [20l 
[23]). and the Hilbert ball ([16] and the references therein). On the other hand 
this theory can be partially extended into more general metric spaces and plays an 
important role in optimal transport theory, PDEs, and probability [T|. For another 
viewpoint; see [2]- 

To demonstrate that the gradient flow theory in Hadamard spaces applies in 
various situations, we now present a number of natural examples of convex lsc 
functions on an Hadamard space (H,d). 



Example 1.5 (Indicator functions). Let K C H be a convex set. Define the indi- 
cator function of K by 

. , Jo, if x e k, 

Then lk is a convex function, and it is lsc if and only if K is closed. 
Example 1.6 (Distance functions). The function 
(5) x h-> d (x, x ) , x eH, 

where xo is a fixed point of "H, is convex and continuous. The function d(-, z) p for 
p > 1 is strictly convex. More generally, the distance function to a closed convex 
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subset C CH, denned as 

dc{x) := inf d(x,c), x G H, 
is convex and 1-Lipschitz [8j p. 178]. 

Example 1.7 (Displacement functions). Let T : % — > H be an isometry. The 
displacement function of T is the function 8t ■ Td. — > [0, oo) defined by 

5t{x) '■= d(x,Tx), x G T-L. 

It is convex and Lipschitz p.229]. 

Example 1.8 (Busemann functions). Let c : [0,oo) —> TL be a geodesic ray. The 
function b c : H — > M defined by 

b c (a:) := lim [d (x, c(t)) - t] , x G H, 

t— »oo 

is called the Busemann function associated to the ray c. Busemann functions are 
convex and 1-Lipschitz. Concrete examples of Busemann functions are given in [8j 
p. 273]. Another explicit example of a Busemann function in the Hadamard space 
of positive definite n x n matrices with real entries can be found in Proposi- 
tion 10.69]. The sublevel sets of Busemann functions are called horoballs and carry 
a lot of information about the geometry of the space in question, see [5] and the 
references therein. 

Example 1.9 (Energy functional). The energy functional is another important 
instance of a convex function on an Hadamard space [Til HH H3 HZ]. Indeed, the 
energy functional is convex and lsc on a suitable Hadamard space of £ 2 -mappings. 
Minimizers of the energy functional are called harmonic maps, and are of an im- 
mense importance in both geometry and analysis. For a probabilistic approach to 
harmonic maps, see [2"51 121)1 12"8"] . 

2. Preliminaries 

We first recall basic notation and facts concerning Hadamard spaces. For further 
details on the subject, the reader is referred to |S]. 

2.1. Hadamard spaces. If a geodesic metric space (X,d) satisfies the following 
inequality 

(6) d (x, 7 (t)) 2 < (1 - t)d (x, 7 (0)) 2 + td (x, 7 (1)) 2 - t(l - t)d ( 7 (0), 7(1)) 2 , 

for any x € X, any geodesic 7 : [0,1] — > X, and any t s [0,1], we say it has 
nonpositive curvature (in the sense of Alexandrov), or that it is a CAT(0) space. 
A complete CAT(0) space is called an Hadamard space. 

The class of Hadamard spaces includes Hilbert spaces, R-trees, Euclidean Bruhat- 
Tits buildings, classical hyperbolic spaces, complete simply connected Riemannian 
manifolds of nonpositive sectional curvature, the Hilbert ball, CAT(0) complexes, 
and many other important spaces included in none of the above classes [5]. 

Let (7-1, d) be an Hadamard space. Having two points x,y E H, we denote the 
geodesic segment from x to y by [x,y\. We usually do not distinguish between a 
geodesic and its geodesic segment, as no confusion can arise. For a point z G [x,y], 
we write z = (1 — t)x + ty, where t = d(x, z)/d(x, y). 

For a function / : % — > (—00,00] we denote dom/ = {x E H : f(x) < 00} . If 
dom/ 7^ 0, we say / is proper. To avoid trivial situations we often assume this 
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property without explicit mentioning. As usually, the symbol dom / stands for the 
closure of dom/. A point x G TL is called a minimizer of / if f(x) = inf-^ /. 

2.2. Convex sets and functions on Hadamard spaces. Recall that a set C C 
TL is convex if x, y G C implies [x, y] C C. A function / : TL — > (— oo, oo] is convex 
provided / o 7 : [0,1] — >■ (— 00, 00] is convex for any geodesic 7 : [0, 1] — > TL. Note 
that the distance function dc is convex and continuous; see Example 11.61 

Proposition 2.1. Let (TL,d) be an Hadamard space and C C TL be closed and 
convex. Then: 

(i) For every x £ TL, there exists a unique point Pc(x) G C such that 

d (x, Pc{x)) = dc(x). 

(ii) If x S TL and y G C, then 

(7) d(x, yf > d (x, P c (x)f + d (y, P c {x)f 

(hi) The mapping Pq ■ TL — > C is nonexpansive and is called the metric pro- 
jection onto C. 

Proof. See [11 Proposition 2.4, p.176]. □ 

The following result comes from [9l Lemma 2.2]. We include its short proof for 
readers' convenience. 

Proposition 2.2. Let {TL,d) be an Hadamard space. If (C a ) a€ j is a nonincreasing 
family of bounded closed convex sets in TL. Then C\ aGl C a 7^ 0. 

Proof. Choose 1 £ H and denote its projection onto C a by x a — Pc a (x). Then 
(d(x, x a )) a is a nondecreasing net of nonnegative numbers, and hence has a limit 
If I = 0, then x G D a C a . If I > 0, then we claim that (x a ) is Cauchy. Indeed, 
denote x a p = \x a + ^xp and apply (O with t = 1/2 to obtain 

d(x, x afj ) 2 < ^d(x,x a ) 2 + ^d(x,Xf;) 2 - ^d(x a ,xp) 2 , 
which implies that (x a ) is Cauchy. The limit point clearly lies in |~) C a . □ 

As a consequence of Proposition 12.21 we obtain that convex lsc functions are 
locally bounded. 

Lemma 2.3. Let (TL,d) be an Hadamard, and f : TL — > (— 00,00] be a lsc convex 
function. Then f is bounded from below on bounded sets. 

Proof. Let C C TL be bounded, and without loss of generality assume that C is 
closed convex. If infc / = —00, then the sets Sn — {x G C : f(x) < —N} for N G N 
are all nonempty. Since all Sn are closed convex and bounded, Proposition 12.21 
yields a point z G Hat^n Clearly f(z) = —00, which is not possible. □ 

Let (TL, d) be an Hadamard space, / : TL — > (—00, 00] be convex lsc and x G TL. 
Then 

(8) ^d (J x x, vf - ^-d (x, vf + ^d (x, J x xf + / (J x x) < f(v), 



for each v G dom/. See [1] Theorem 4.1.2]. Furthermore, if x G dom / and we set 
x(t) = Stx, for t G [0, 00), then 

(9) \d (x(t), v) 2 - id (x(0), v) 2 + f (x(t)) < f(v), 
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for any t > and v G dom/. See [U (4.0.13)]. 

2.3. Weak convergence in Hadamard spaces. Here we recall the definition 
and basic properties of the weak convergence in Hadamard spaces. For a systematic 
account, the reader is referred to [4] Section 3]. 

We shall say that a bounded sequence (x n ) C T~L weakly converges to a point 
x G T-L if P-yX n — y x as n — y oo for any geodesic 7 : [0, 1] — >• H with 7(0) = x. 

If there is a subsequence (x n ) of such that a;„ p A z for some z G %, we say 
that z is a weafc cluster point of the sequence (a; n )- The following important result 
first appeared in [11] Theorem 2.1]. 

Proposition 2.4. i?ac/i bounded sequence has a weakly convergent subsequence, or 
in other words, each bounded sequence has a weak cluster point. 

Proof. See [IT] Theorem 2.1] or [4] Proposition 3.2]. □ 

Lemma 2.5. Let C C T-L a closed convex set. If (x n ) C C and G then 

xeC. 

Proof. See [H Lemma 3.7]. □ 

Definition 2.6. We shall say that a function f : T-L — > (— 00,00] is weakly lsc at 
a given point x G dom / if 

liminf f(x n ) > fix), 

n— y 00 

for each sequence x n A x. We say that f is weakly lsc if it is lsc at any x 6 dom/. 
Lemma 2.7. /// : % — > (—00,00] a lsc convex function, then it is weakly lsc. 
Proof. See [4] Lemma 3.9]. □ 

2.4. Nonexpansive mappings in Hadamard spaces. 

Definition 2.8. Let (T~L, d) be an Hadamard space and (F p ) >Q be a family of 
nonexpansive maps F p : % — > %. Fix A, p > and x G %. Then the map 

1 - 

(10) y^ — 1 x + -£- I F p y, yen, 

p p 

A 

is a contraction with Lipschitz constant ttx, and hence has a unique fixed point, 
which will be denoted J\^ p {x), see [24[ Lemma 3.1]. 

Theorem 2.9. Let (H,d) be an Hadamard space and f : H — > (—00,00] be of the 
form ©. We use NotationTnA 

(i) Let Jx.p be the mapping from Definition \2.8\ associated with the nonexpan- 
sive map F p — yjp o • • • o J-Jj . // we have 

J\(x) = lim J\. p {x), 



for any x G dom/ and A > 0, then, 

A ° • • • ° A) (x), 

n n J 

for any x G dom / and t > 0, and the convergence is uniform with respect 
to t on any compact subinterval of [0, 00). 
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(ii) Let now J\,p be the mapping from Definition \2.8\ associated with the non- 
expansive map F p — (S*p o P k ■ ■ ■ o Sp o Pi) . If we have 

J\(x) = lim J\.p(x), 



for any x 6 dom/ and A > 0, then, 

n n J 

for any x 6 dom / and t > 0, and the convergence is uniform with respect 
to t on any compact subinterval of [0, oo). 

Proof. See [Ml Theorem 3.13]. □ 
3. Proof of the main result 



In this section, we give the promised alternative proof of Theorem 11.41 For 
reader's convenience, we recall the statement here. 

Theorem 3.1 (Stojkovic). Let (H, d) be an Hadamard space and f : H — > (— oo, oo] 
be of the form ([3]). We use Notation \1.3[ Then, for any t > 0, and x € dom/, we 
have 

(\ ( n ) 
jio.-.oji (a?), 
n n / 

and, 

(13b) SAx) = lim [S\ oP k ---oS\ o P-A (x). 

The convergence in both (|13a[) and (|13b|) is uniform with respect to t on any compact 
subinterval of [0, oo). 

Proof. We mix various inequalities derived in |24] and employ the weak convergence 
as appropriate. 

We first show (113a|) . By Theorem 12. 91 it suffices to show J\^x —> J\x as t — > 0, 
where Ja, p now corresponds to the choice F p = (jj o • • • o JM . Put x (t) = Ja,*^ 
and JCj(t) = J(xj-i{t) for j = 1, . . . , k. By the definition of J\_ t we have 

(w) x ^ = ih^ x+ irh Xk ^ 

and consequently, 

(15) d(x,x k (t)) = i —^-d(x,x a (t)) , 
together with, 

(16) d(x (t),x k (t)) = jd(x,x (t)) . 

Applying ([8]) for each fj, with j = 1, . . . , fc, and summing the resulting inequal- 
ities up, we arrive at 

fc fc 

(17) 2tf(v) >2tJ2 fj Mt))+d (x k (t), v) 2 ~d(x (t), vf + Y, d fo-ift), ^(t)) 2 , 
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for every v G dom /. The inequality (|6]) yields 



-d(v,x k (t)) 2 > d(v,x (t)) 2 - 1 ,, d(v,x) 



1 + X/t v ' ^ " '--v>-"v» 1 + AA 

(18) + j. X/t 2 d(x,x k (t)f. 

(1 + \/t) 

Combining this inequality with (|15p and (|T71) gives after some elementary calcula- 
tions that 

k 

(19) 2Xf(v) >2X^2 fj (*j(*)) + d (iso(*), z) 2 + d (*<>(*), «) 2 - d(x, vf . 

3=1 

for every w E dom /. 

Fix now a sequence t n — > 0. For every j = 0, 1, . . . , k, the sequence (xj (t n )) n is 
bounded due to [24l Proposition 4.2]. Then Lemma [2.31 yields that the sequence 
(fj ( x j (tn))) n is bounded from below and the inequality (|19p implies that it is also 
bounded from above. 

Consider (fTTf with £„ and take the limit n — > oo to obtain via (|16p that 

fe 

(20) lim V d (xj-i (tn) , xj (t n )f = 0. 

71— >OC * * 

3=1 

Let z 6 ?{ be a weak cluster point of xq (t n ) and xq (tn p ) be a sequence weakly 
converging to z. Recall that the existence of a weak cluster point was guaranteed by 
Proposition 12.41 By (j20)) . also Xj (t np ) weakly converges to z, for all j = 1, . . . , k. 



Consequently, z E dom / due to Lemma \2. 5 1 

Consider next the inequality (fT9|) with t np and take the limit p — » oo. Since the 
functions /j, with j = 1, . . . , fc, are weakly lsc by Lemma 12.71 we obtain z = J\x. 
In particular, z E dom/. Since z was an arbitrary weak cluster point of xo (t n ) , we 
get Xo (t n ) —> J\x. Applying once again (|19|) with v = Jax, gives xq (t n ) — > J\x as 
n — > oo. This finishes the proof of (|13a[) . 

Next we show (|13b|) . By Theorem 12.91 it suffices to show J\.tx — > J\x as t — >• 0, 
where J\ tP now corresponds to the choice F p = (S p o P k ■ ■ ■ o S p o P x ) . 

Put x (t) = J\jx and Xj(t) = o Pj\ Xj_i(t) for j = 1, . . . , N. Since we again 



have pTj) . the same arguments as above yield that (|T5|) and also (fl8|) hold true. 

Applying ([9]) with /j , for j = 1 , . . . , N, and summing the resulting inequalities 
up gives 

(21) f(v) > \d (x k (t),v) 2 - Id (x (t), v) 2 + ^ /, (^-(t)) , 

3=1 

for any v E dom/. Combining this inequality with (1151) and (|18[) gives after some 
elementary calculations that (TT9")) holds. 

Fix now a sequence t n —t 0. For every j = 0, 1, . . . , k, the sequence (xj (t n )) n is 
bounded due to [24J Proposition 4.2]. Then Lemma [231 yields that the sequence 
(fj ( x j is bounded from below and the inequality (|T9| implies that it is also 

bounded from above. 

The inequality © implies that 

d(x 3 (t n ),v) 2 ~d(PjXj-i (t n ) ,v) 2 + 2t n fj (xj (<„)) < 2t n fj(v), 
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for each j = 1, . . . , k and v G dom fj. Take the limit n — > oo to obtain 
(22) limsup d(xj (t n ) ,v) 2 — d(PjXj^i (t n ) ,v) 2 < 0, 



for each j = 1, . . . , k and i> G dom/j, and even u G dom/j. In combination with 
the fact that 

d (xj (t n ) , v) 2 — d (xq (t n ) , v) 2 — y 0, as n — >• oo, 



for each j = 1, . . . ,k and v G dom/, contained in [Ml (4.20)], we obtain 



lim sup 



d (Xj-i (t„) , w) 2 - d (i n ) , w) 2 



<0, 



for each j = 1, . . . , k and w G dom /. The inequality ([7]) now reads 

d (PjXj^t (t n ) , v) 2 + d (xj_i (t n ) , PjXj^i (t n )) 2 < d (xj^i (t n ) , v) 2 



for any w G dom /-,-. Therefore 

(23) limsupc?(a; : ,_i (t n ) ,PjXj-i (t n )) 2 < 0, 

n— >oo 

for any j = l,...,k. 

Let z £ H be a weak cluster point so (t n ) and (tn p ) be a sequence weakly 
converging to z. Again, the existence of a weak cluster point was guaranteed by 
Proposition 12.41 By (|2"3"|) we have z G dom /i . Apply now the inequality ([22|) with 
w = PjXj-i (t n ) and (|23p to obtain xi (t np ) — > -2- By the same argument then also 
z G dom/ and Xj (t np ) weakly converges to z, for j — 2, . . . , k. Consider next the 
inequality (|19j) with v = J\x and t = t 7lp and take the limit p — > oo. Since the 
functions /j, with j = 1, . . . , fc, are weakly lsc by Lemma \2.7\ we obtain z = J\x. 
In particular, z G dom/. Since z was an arbitrary weak cluster point of xo {t n ) , 
we get xo (t n ) A J\x. Applying once again (|19|) with v = J\x and t — t n gives 
2^0 (t n ) Jai as n-> oo. This finishes the proof of (|13b[) . □ 
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